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Abstract 

We consider a nonparametric goodness of fit test problem for the drift 
coefficient of one-dimensional small diffusions. Our test is based on discrete 
observation of the processes, and the diffusion coefficient is a nuisance func- 
tion which is estimated in our testing procedure. We prove that the limit 
distribution of our test is the supremum of the standard Brownian motion, 
and thus our test is asymptotically distribution free. We also show that our 
test is consistent under any fixed alternatives. 
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1 Introduction 



Goodness of fit tests play an important role in theoretical and applied statistics, 
and the study for them has a long history. Such tests are really useful especially if 
they are distribution free, in the sense that their distributions do not depend on the 
underlying model. The origin goes back to the Kolmogorov-Smirnov and Cramer- 
von Mises tests in the i.i.d. case, established early in the 20th century, and they are 
asymptotically distribution free. On the other hand, the diffusion process models 
have been paid much attention because they are useful in many applications such 
as Biology, Medicine, Physics and Financial Mathematics. However, the problem 
of goodness of fit tests for diffusion processes has still been a new issue in recent 
years. Kutoyants P] considered this problem in his Section 5.4, but his tests are 
not asymptotically distribution free. Dachian and Kutoyants [1] and Negri and 
Nishiyama [6] proposed some asymptotically distribution free tests. However, all 
their results are based on continuous time observation of the diffusion processes. 
The main contribution of the present paper is that our test is based on discrete 
time observation, which is more realistic in applications. 

Consider a one-dimensional stochastic differential equation (SDE) 



where S and a are functions which satisfy some properties described in Sec- 
tion 2, and t ^ Wt is a. standard Wiener process defined on a stochastic basis 
{Q, J^, {J^t)te[o,T], P)- Here, T > is a fixed time. We consider a case where a 
unique strong solution X to this SDE exists, and we will consider the asymptotic 
as £ I 0. Statistical inference for this model based on continuous observation was 
studied by Kutoyants [3]. As for discrete observation cases, many researchers have 
treated the model in some parametric settings; see e.g. S0rensen and Uchida [8] 
and references therein. In this paper, we are interested in nonparametric goodness 
of fit test for the drift coefficient 5", while the diffusion coefficient cr^ is an unknown 
nuisance function which we estimate in our testing procedure. That is, we consider 
the problem of testing the hypothesis Hq : S = So versus Hi : S ^ Sq for a given 
Sq. The meaning of the alternatives "S" 7^ Sq" will be precisely stated in Section 4. 
We consider the following situation. 

Sampling Scheme. The process X = {Xt;t G [0,T]} is observed at times = 
tl < tl < ■ ■ ■ < tLj) = T, such that hs = 0(5^) as e J, 0, where hs = maxi<j<„(e) |tf — 



We may assume e < 1 and < 1 without loss of generality. We will propose 
an asymptotically distribution free test based on this sampling scheme. 

The organization of the article is as follows. In Section 2, we state some con- 
ditions for [S, a) which are assumed throughout this work. Section 3 gives the 
main result under the null hypothesis, assuming the existence of a consistent es- 
timator for the limit variance. In Section 4, we prove that our test is consistent 
under any fixed alternatives, assuming the existence of a consistent estimator for 
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the limit variance again. A consistent estimator for the hmit variance is exphcitly 
constructed in Section 5. The proofs for lemmas and a theorem in Section 5 will 
be given in Section 6, with help from the Appendix. 



2 Preliminaries 

Let us list some conditions for the pair of functions {S, a). 
Al. There exists a constant C > such that 

\Six) - Siy)\ < C\x - yl - < C\x - y\. 

A2. sup^gjo.T] ^l-'^sP < oo. O 

Under Al, the SDE ([1]) has a unique strong solution X, and notice also that 
there exists a constant C" > such that 

\Six)\ < C"(l + |x|), |(t(x)| < C'il + \x\). 

To see this, just put y = 0. The constant C depends on the values S{0) and ct(0), 
however the constant C itself depends on the choice of the functions {S,a). So it 
is convenient to introduce the notation 

Ks,a = max{C, C'}. 

Let us fix some more notations. For given S, let us denote by x^ = {xf; t G 
[0,T]} the solution to the ordinary differential equation 

= S{xf) with the initial value Xq = Xq. 

A3. := ^Jj^a{xfydt > 0. O 

Let us close this section with making some conventions. We denote by C[0, T] 
the space of continuous functions on [0,T], and by £°°[0,r] the space of bounded 
functions on [0, T]. We equip both the spaces with the uniform metric. We denote 
by "^P" and the convergence in probability and in distribution as 5 J, 0, 

respectively. The notation "— always means that we take the limit as e | 0. 

3 Asymptotically distribution free test 

Throughout all this section, we shall suppose that Al - A3 are satisfied for some 
{So, a). 

Our test statistics is based on the random field W = {U^{u);u & [Oi ^]} defined 

by 

n{e) 

U%u) = e-' J2 Mo,u]mXn - ^tf_, - So{X,.J\tt - tu\]. 
1=1 



We will approximate by the following random fields = {V^{u);u G [0,T]} 
and = {M^; u G [0, T]}, defined respectively by: 

i=i 

Ml = e-' I [dX,-S^{X,)ds]. 
Jo 

We present some lemmas which will be proved in Section 6. 
Lemma 1 sup„g[o^T] {U'^iu) — V^^('u)| 0. 
Lemma 2 sup^gfo ^] \ V%u) - 0. 

Lemma 3 G in C[0,T], where G = {G{u);u G [0,T]} is a Brownian 

motions with variance 



/u 
aix^'^fdt. 
-oo 



Combining these lemmas, we obtain the following result. 
Theorem 4 U"" G m ^°^[0,T], where G is the process appearing in Lemma{3 

By the continuous mapping theorem, we have the following. 
Corollary 5 It holds that 



sup \U%u)\ ^ sup \Bt\= T^so-a sup \B, 
u€[0,T] t(^[o,^oJ ^ <e[o,i] 



where t ^ Bt is a standard Brownian motion, and the notation '—^ " means that 
the distributions are the same. 

So we have the main result of the paper. 

Theorem 6 Under Hq : S = Sq, suppose that is a consistent estimator for 
T,So,a- Then we have 

sup„g[o,r]l^^(^)l , 

> sup \nt\, 

te[o,i] 

where t^Bt is a standard Brownian motion. 

The construction of a consistent estimator for Y^s^a will be discussed in 
Section 5. 
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4 Consistency of the test 

Let So be that in Section 3. We denote by S the class of functions S satisfying Al 
- A3 and 

[ \s{xf) - So{xf))dt ^ for some us G [0, T]. (2) 
Jo 

The precise description of our problem is testing the null hypothesis Hq : S = Sq 
versus the alternatives Hi : S & S . 

We will prove that our test is consistent. Fix S E S. We can write If^ = Ug+U^^ 
where 

n{e) 

m^) = e-' Yl MoMm^n - - s{x,.J\tt - till] 

1=1 

and 

n{e) 

Uliu) = e-' l[o,.](tD(^(^t|_J - SoiX,.jm - till 

i=l 

Now we have 

sup |f/^('u)| > sup \U^{u)\ — sup \Ug{u)\. 

u€[0,T] «G[0,T] «e[0,T] 

Since 5" satisfies Al - A3, by the same argument as in Section 3, the random field 
f7| converges to the corresponding Gaussian random field with Sq replaced by S. 
So the second term of the right hand side is Op (1). As for the first term of the 
right hand side, we have the following claim. 

Lemma 7 Choose us G [0,T] as in ([^. Then it holds that \U^{us)\ ^ OpiV). 
We therefore obtain the consistency of the test. 

Theorem 8 Suppose thatT,'^ is a consistent estimator for Y^s^a ■ Under Hi : S E S, 
it holds that 

sup„g[o,T] \U'{u)\ 

^ T <^PUj- 



5 Consistent estimator for E 



S,a 



In order to construct an asymptotically distribution free test, we need a consistent 
estimator for S^o-- The following result gives us an answer. 

Theorem 9 For any {S,a) which satisfies Al and sup^g^ < oo (which 
is stronger than A2 ), 



is a consistent estimator for T^sa 



\ 



n(e) 



1=1 
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6 Proofs 



Proof of Lemma [II Without loss of generality, we may assume that e < 1 and 



/ig < 1 . It follows from Lemma [12] that 



E I sup \U'{u) - V'{u) 
ue[o,T] 



< e 



< e 



n{e) f.e 



1=1 H-i 



I Ks„.E\Xt^_^ - X,\ds 

i=i 



< e-^TKs„.Cihl'^ 
0. 

So we have the assertion of the lemma. 
Proof of Lemma [H Notice that 



1 V / l[o,„](s)[rfX,-S'o(X,)ds] 

i=i Jn-i 

' / [dXs - SoiXs)ds] \/ue[tU,tl 



V'(u)+e 



and that = V^lT). Now we have 



E 



sup \V%u)-M^\ 
ue[o,T] 



4 nie) 

= y^E sup \V%u)-Ml 



£|4 



n(e) 

< >^ sup 



It follows from Burkholder-Davis-Gundy's inequality (see e.g. Theorem 26.12 of 
Kallenberg [2]) that, for a constant depending only on A; = 4, the right hand 
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side is bounded by 

n{e) 
i=l 



a 



{Xs)ds 



n{e) / tl ft- ^ 

< Ci^El Ids- a{Xsfds 

i=i ytt-i , 



< C4T max 



l<i<n(e) Jxe 

< c^Th^ sup Ea{X. 
^ 0. 



Ea{X,fds 



SG[0,T] 



The proof is finished. 

Proof of Lemma\^ When = 5*0, it holds that 



(y{Xs)dWs 



We will apply the central limit theorem for continuous martingales. 

{M^')u = r (T{X,fds 
Jo 

= {a{Xsf - a{xff)ds + r (y{x^>?ds 

= ih + iii)- 

Now, using Lemma fTOl we have 

|(/)| < r \a{X,f - a{xf^f\ds 
Jo 

Jo 



Sol 



< Kso,a sup \Xt-X\ 
te[o,T] 



< Kso,aexp{Kso,aT) ■ e sup 

<e[o,T] 

= Opie). 



\a{X,) + a{xf')\ds 



□ 



(r{X,)dWs 



[ |a(X,) + a(xf°)|cis 
Jo 



So we have (M^)„ a{x^°)'^ds, and the weak convergence of the process 

u ^ holds. 

Proof of Lemma^ We simply denote u = us- We consider the following random 
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variables: 













n[e) 

= X]l[0,n](^i) 
1=1 


Jn-1 


)-SoiX,sJ)ds 


A\ 


n(e) 

= X]l[0,«](^i) 
i=l 




- So{Xs))ds; 




= E / ' i[o. 

i=i Jn-i 




So{Xs))ds; 


^4 


Jo 


-So{x',))ds. 





First, it holds that 



E\Al-Al\ < Y.^ { 

i=l J^-1 



\S{X,.J - S{X,)\ + \S,{X,.J - So{X,)\^ds 

n{e) 



0, 

where Ci is a constant appearing in Lemma fT2l So we have 1^41 — 74|| — 0. 
Next, 

^2-^3 = E / ' (l[o.-l(^0-l[o,H(^)){'5ra-'5ora}o!s 

= [ {SiXs) - So{Xs)}ds ^ue[tU,tl). 
JtU 

If u = Us = T, then A2 = A^. Since 

/■tf 

max / {|S'(X,)| + |S'oM|}rfs 

l<i<n(e) J^e 

< he sup {\S{Xs)\ + |5o(^.)|} = Op(/l,), 

se[o,T] 

we have - A|| 0. 
Finally, notice that 

Al -A,= HsiXs) - So{X,))ds - /V(a;f ) " Soix^,))ds. 
Jo Jo 
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It follows from Lemma [10] that 



/ \SiXs) - Six',)\ds < / Ks,a\X,-x':\ds 
Jo Jo 



< TKs,. sup \Xt-xf\ 

ie[o,T] 

< TKs,a ■ exp{Ks,aT) ■ e sup 

*e[o,T] 

= Opie). 



By the same way, it holds that 
/ \SoiXs)-So{x^,)\ds<TKs„.-exp{Ks,.T)-e sup [ a{Xs)dWs 

Jo t£\Q,T] Jo 



Op(e). 



Thus we have |y4| — A^l —>■ 0. 

Consequently, we obtain — >^ ^ 0, which implies that \U%^{us)\ 7^ Op(l). 

□ 

Proof of Theorem\^ By Ito's formula, we have 



- \Xt._X = 2 / XsdXs + e' / a{X,Yds 



Since 



n{e) 



1=1 



it is enough to show that 



and 



i^s-X,. JdX.^^O 

i=i Jn-i 

Jo 



The latter is proved by the same argument as that in the proof of Lemma [31 As 
for the former, observe that 



r {x,-x,. yx, 

i=i 

< e-^Yl \Xs-X,.J\S{X,)\ds + e-' {X, - X,.Ja{Xs)dWs 



i=i " i-i-i 



i=i i-i-i 
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By Lemma [121 the expectation of the first term on the right hand side is 

n(e) 

e-'Y. / " E{\X,-X,.J\S{X,)\)ds 
i=i -^^1-1 

"(^) ft! . ^ 

< yjE\X,-Xt.J'^^E\S{X,)\'ds 
i=i -^^i-i 

< e-^y^ VC2max{/i2,£2/iJ^^|5(X,)|2ds 

i=i Jn^i 

< E-^T^2^^^{he,ehy^} ■ sup ^E\S{X,)\^ 

se[o,T] 

0. 

On the other hand, the expectation of the square of the second term on the right 
hand side is 

"(e) ^t? 
s-'Te I \X,-X,. fa{X,)'ds 

i=i ^n-i 

n{e) t1 

< e-^Y. ^E\X,-Xt^J^^Ea{X,Yds 

1=1 "'*f-i 

< e-^T^dhl sup ^Ea{X,Y 

se[o,T] 

^ 0. 

This proves the consistency of our estimator. □ 

Appendix 

In the main part of this article, we use the following inequality which is well known. 

Lemma 10 For any solution X = {Xt;t E [0,T]} to the SDE (Qp, it holds that 

sup \Xt — Xt\ < exp{Ks^aT) ■ e sup 

te[o,T] ' te[o,T] 

Proof. The proof is a simple application of Gronwall's inequality: apply Lemma 
4.13 of Liptser and Shiryaev [5j for cq = esup^gjg.T] 

C2 = 0, u{t) = \Xt — Xt\, and v(t) = 1. □ 

The following fact is used in the article many times, so we state it as a lemma 
here. 



/ a{X,)dWs . 
Jo 
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Lemma 11 Let f : —>■ M. be a measurable function such that 1/(2;) | < H{1 + \x\) 
for some H > 0. Let X = {Xf, t G [0, T]} be any stochastic process. Let k > and 
assume sup^gjQ j.] E\Xt\'' < 00. Then, it holds that sup^g^ j.] E\f{Xt)\'' < 00. 

Proof. Since 

\x + < \\x\ + \y\\'' < |2max{|a:|, \y\}\'' < 2''{\x\'' + \y\''}, 

the lemma is trivial. □ 
The following lemma is rather well known, but we give a full proof for references. 



Lemma 12 Let X = {Xt, t E [0, T]} be a solution to the SDE (Qp for {S, a) which 
satisfies Al. Let k > and assume supjgjo,T] E\Xt\'''^'^ < 00. Then, there exists a 
constant Ck > 0, such that for any <t <t' <T and any e > 



E\Xt,-Xt\'' < Ckmax{\t' -t\\e^\t' -t\''/^}. 



In particular, if \t' ~ t\ < 1 and e < 1, then 

ElXf — XA'^ < Ck\t' — t 



k/2 



Remark. The constant Ck is not a universal constant depending only on k. It 
actually depends on S, a, T. However, it does not depend on t, t', e. 

Proof. First we consider the case k >2. Notice that 

-Xt= [ SiXs)ds + e I a{Xs)dWs. 
Jt Jt 



It follows from Holder's inequality that 

k 



t' 



S{Xs)ds 



\SiXs)\''ds. 



Taking the expectation, it holds that 

k 



E 



t' 



S{Xs)ds 



< \t' -t\'' sup E\S{Xs)\''. 

se[o,T] 



On the other hand, it follows from Burkholder-Davis-Gundy's inequality that 
there exists a constant Ck > 0, depending only on k, such that 

k/2 







k 




E 


6 j a{X,)dWs 


< Cks'^E 


y* cx{x,fds 
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When > 2, it follows from Holder's inequality that 



k/2 



Taking the expectation, we have 



a{X,fds 



fc/2 



<e^\t'-t\^'^ sup E\a{Xs)\^. 
se[o,T] 



When k — 2, we actually have 

e^E f a{Xsfds 



Ea{Xsfds 
< e^\t' -t\ sup Ea{Xsf. 

s&[0,T] 

Thus the proof for the case A; > 2 is finished. 

For k e (0,2), by Jensen's inequality, it holds that 

{E\Xt> - < E\Xt' - < C2 max{|t' - t\\ - 

thus we obtain the desired inequality. 



□ 
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